video lecture
Transforming Higher Education with AI-Powered Video Lectures
The integration of artificial intelligence (AI) into video lecture production has the potential to transform higher education by streamlining content creation and enhancing accessibility. This paper investigates a semi -automated workflow that combines Google Gemini for script generation, Amazon Polly for voice synthesis, and Microsoft PowerPoint for video assembly. Unlike fully automated text -to -video platforms, this hybrid approach preserves pedagogical intent while ensuring script -slide synchronization, narrative coherence, and customization. Case studies demonstrate the effectiveness of Gemini in generating accurate and context - sensitive scripts for visually rich academic presentations, while Polly provides natural - sounding narration with controllable pac ing. A two-course pilot study was conducted to evaluate AI -generated instructional videos (AIIV) against human instructional videos (HIV). Both qualitative and quantitative results indicate that AIIVs are comparable to HIVs in terms of learning outcomes, w ith students reporting high levels of clarity, coherence, and usability. However, limitations remain, particularly regarding audio quality and the absence of human - like avatars. The findings suggest that AI - assisted video production can reduce instructor workload, improve scalability, and deliver effective learning resources, while future improvements in synthetic voices and avatars may further enhance learner engagement.
Generating Narrated Lecture Videos from Slides with Synchronized Highlights
Turning static slides into engaging video lectures takes considerable time and effort, requiring presenters to record explanations and visually guide their audience through the material. We introduce an end-to-end system designed to automate this process entirely. Given a slide deck, this system synthesizes a video lecture featuring AI-generated narration synchronized precisely with dynamic visual highlights. These highlights automatically draw attention to the specific concept being discussed, much like an effective presenter would. The core technical contribution is a novel highlight alignment module. This module accurately maps spoken phrases to locations on a given slide using diverse strategies (e.g., Levenshtein distance, LLM-based semantic analysis) at selectable granularities (line or word level) and utilizes timestamp-providing Text-to-Speech (TTS) for timing synchronization. We demonstrate the system's effectiveness through a technical evaluation using a manually annotated slide dataset with 1000 samples, finding that LLM-based alignment achieves high location accuracy (F1 > 92%), significantly outperforming simpler methods, especially on complex, math-heavy content. Furthermore, the calculated generation cost averages under $1 per hour of video, offering potential savings of two orders of magnitude compared to conservative estimates of manual production costs. This combination of high accuracy and extremely low cost positions this approach as a practical and scalable tool for transforming static slides into effective, visually-guided video lectures.
Shiksha: A Technical Domain focused Translation Dataset and Model for Indian Languages
Joglekar, Advait, Umesh, Srinivasan
Neural Machine Translation (NMT) models are typically trained on datasets with limited exposure to Scientific, Technical and Educational domains. Translation models thus, in general, struggle with tasks that involve scientific understanding or technical jargon. Their performance is found to be even worse for low-resource Indian languages. Finding a translation dataset that tends to these domains in particular, poses a difficult challenge. In this paper, we address this by creating a multilingual parallel corpus containing more than 2.8 million rows of English-to-Indic and Indic-to-Indic high-quality translation pairs across 8 Indian languages. We achieve this by bitext mining human-translated transcriptions of NPTEL video lectures. We also finetune and evaluate NMT models using this corpus and surpass all other publicly available models at in-domain tasks. We also demonstrate the potential for generalizing to out-of-domain translation tasks by improving the baseline by over 2 BLEU on average for these Indian languages on the Flores+ benchmark. We are pleased to release our model and dataset via this link: https://huggingface.co/SPRINGLab.
Transition-Aware Multi-Activity Knowledge Tracing
Zhao, Siqian, Wang, Chunpai, Sahebi, Shaghayegh
Accurate modeling of student knowledge is essential for large-scale online learning systems that are increasingly used for student training. Knowledge tracing aims to model student knowledge state given the student's sequence of learning activities. Modern Knowledge tracing (KT) is usually formulated as a supervised sequence learning problem to predict students' future practice performance according to their past observed practice scores by summarizing student knowledge state as a set of evolving hidden variables. Because of this formulation, many current KT solutions are not fit for modeling student learning from non-assessed learning activities with no explicit feedback or score observation (e.g., watching video lectures that are not graded). Additionally, these models cannot explicitly represent the dynamics of knowledge transfer among different learning activities, particularly between the assessed (e.g., quizzes) and non-assessed (e.g., video lectures) learning activities. In this paper, we propose Transition-Aware Multi-activity Knowledge Tracing (TAMKOT), which models knowledge transfer between learning materials, in addition to student knowledge, when students transition between and within assessed and non-assessed learning materials. TAMKOT is formulated as a deep recurrent multi-activity learning model that explicitly learns knowledge transfer by activating and learning a set of knowledge transfer matrices, one for each transition type between student activities. Accordingly, our model allows for representing each material type in a different yet transferrable latent space while maintaining student knowledge in a shared space. We evaluate our model on three real-world publicly available datasets and demonstrate TAMKOT's capability in predicting student performance and modeling knowledge transfer.
Deep Learning for Healthcare
This course is intended for persons involved in machine learning who are interested in medical applications, or vice versa, medical professionals who are interested in the methods modern computer science has to offer to their field. We will cover health data analysis, different types of neural networks, as well as training and application of neural networks applied on real-world medical scenarios. We cover deep learning (DL) methods, healthcare data and applications using DL methods. The courses include activities such as video lectures, self guided programming labs, homework assignments (both written and programming), and a large project. The first phase of the course will include video lectures on different DL and health applications topics, self-guided labs and multiple homework assignments.
Can Population-based Engagement Improve Personalisation? A Novel Dataset and Experiments
Bulathwela, Sahan, Verma, Meghana, Perez-Ortiz, Maria, Yilmaz, Emine, Shawe-Taylor, John
This work explores how population-based engagement prediction can address cold-start at scale in large learning resource collections. The paper introduces i) VLE, a novel dataset that consists of content and video based features extracted from publicly available scientific video lectures coupled with implicit and explicit signals related to learner engagement, ii) two standard tasks related to predicting and ranking context-agnostic engagement in video lectures with preliminary baselines and iii) a set of experiments that validate the usefulness of the proposed dataset. Our experimental results indicate that the newly proposed VLE dataset leads to building context-agnostic engagement prediction models that are significantly performant than ones based on previous datasets, mainly attributing to the increase of training examples. VLE dataset's suitability in building models towards Computer Science/ Artificial Intelligence education focused on e-learning/ MOOC use-cases is also evidenced. Further experiments in combining the built model with a personalising algorithm show promising improvements in addressing the cold-start problem encountered in educational recommenders. This is the largest and most diverse publicly available dataset to our knowledge that deals with learner engagement prediction tasks. The dataset, helper tools, descriptive statistics and example code snippets are available publicly.
Digital Twins and Artificial Intelligence as Pillars of Personalized Learning Models
Modern educational systems have not really evolved enough to meet the needs of modern students.21 No wonder, the percentage of dropouts from university studies is quite high (40% in the U.S. and 10% in Europe7,9). The university student profile has changed over the years. While yesterday's students were mainly full-time, today's students face challenges such as work commitments, family obligations, financial constraints, physical impairments, and learning models that do not adequately engage students or help them understand core concepts.11 One might think that this issue concerns only those who fail to complete their studies, but this is view is shortsighted. Today's educational system deficiencies will affect the welfare of tomorrow's society. To improve current learning models, academic institutions around the world agree that the time has come to improve the world of education, moving from a traditional approach--where learning is standardized and available only to those with access to educational buildings--to a new paradigm that enables students to personalize their educational pathway, so they can progress at their own pace.19,21
Top 3 Free Resources to Learn Linear Algebra for Machine Learning - KDnuggets
Mathematics is the core of all machine learning algorithms. And while it isn't a prerequisite to have formal math education in order to become a data scientist, you need to understand the principles of the subject well enough to successfully build models that add value. In an article I wrote previously, I explained the three branches of mathematics that were essential to gain a deeper understanding of ML algorithms -- statistics, calculus, and linear algebra. This article will solely focus on linear algebra, as it forms the backbone of machine learning model implementation. Linear algebra concepts like vectorization allow for faster computation speeds, and are implemented in libraries like Pandas, Scipy, and Scikit-Learn.
Top 20 Free Online Courses For Python Beginners
Python is an ideal first programming language for anyone interested in coding. Here are the top 20 Free Online Courses for Python from Udemy we've curated to help you learn Python. In this post you'll find 20 good beginners Python courses you can learn from and start your career as a software developer or web developer. All courses are free and you'll have lifetime access to the material! What better way to learn a new programming language than to dive right in? Python may be a general-purpose programming language, but it has specialized libraries that lend themselves to machine learning, artificial intelligence (AI), and scientific computing.
GitHub - lohitakshnandan/Amazing-AI-Resources: A curated list of Artificial Intelligence (AI) courses, books, video lectures,...etc.
Artificial Intelligence (AI) refers to the simulation of human intelligence in machines that are programmed to think like humans and mimic their actions. The term may also be applied to any machine that exhibits traits associated with a human mind such as learning and problem-solving. Artificial Intelligence is advancing by leaps and bounds. Recent research in the fields of Data Science, Machine Learning, Natural Language Processing and other sub fields of AI has already started to impact the lives of common people. AI is no more a superficial concept.